Category-Guided Visual Question Generation (Student Abstract)

نویسندگان

چکیده

Visual question generation aims to generate high-quality questions related images. Generating based only on images can better reduce labor costs and thus be easily applied. However, their methods tend similar general that fail ask about the specific content of each image scene. In this paper, we propose a category-guided visual model with multiple categories focus different objects in an image. Specifically, our first selects appropriate category relationships among objects. Then, corresponding selected categories. Experiments conducted TDIUC dataset show proposed outperforms existing models terms diversity quality.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Segmentation Guided Attention Networks for Visual Question Answering

In this paper we propose to solve the problem of Visual Question Answering by using a novel segmentation guided attention based network which we call SegAttendNet. We use image segmentation maps, generated by a Fully Convolutional Deep Neural Network to refine our attention maps and use these refined attention maps to make the model focus on the relevant parts of the image to answer a question....

متن کامل

Abstract category learning

Category Learning Atsushi Hashimoto and Haruo Hosoya Department of Computer Science The University of Tokyo 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan Abstract. Motivated by a neurophysiological experiment on prefrontal cortex, we study a scheme for learning abstract categories. An abstract category represents a set of vectors that are identical to each other modulo substitution, e.g., ’ABAB’, ’BABA’...

متن کامل

Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering

The problem of Visual Question Answering (VQA) requires joint image and language understanding to answer a question about a given photograph. Recent approaches have applied deep image captioning methods based on recurrent LSTM networks to this problem, but have failed to model spatial inference. In this paper, we propose a memory network with spatial attention for the VQA task. Memory networks ...

متن کامل

Improving Student Question Classification

Students in introductory programming classes often articulate their questions and information needs incompletely. Consequently, the automatic classification of student questions to provide automated tutorial responses is a challenging problem. This paper analyzes 411 questions from an introductory Java programming course by reducing the natural language of the questions to a vector space, and t...

متن کامل

Category Generation

People exhibit the ability to imagine new category instances and new categories, with examples ranging from everyday activities like cooking to scientific discovery. This ability, which we call category generation, is not addressed by standard models of category learning, which focus on classifying instances rather than generating them. We develop a probabilistic account of category generation ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i13.26991